Question Answering Module Leveraging Heterogeneous Datasets
نویسندگان
چکیده
Question Answering has been a well-researched NLP area over recent years. It become necessary for users to be able query through the variety of information available - it structured or unstructured. In this paper, we propose module which a) can consume data formats heterogeneous pipeline, ingests from product manuals, technical forums, internal discussion groups, etc. b) addresses practical challenges faced in real-life situations by pointing exact segment manual chat threads solve user c) provides segments texts when deemed relevant, based on and business context. Our solution comprehensive detailed pipeline that is composed elaborate ingestion, parsing, indexing, querying modules. capable handling plethora sources such as text, images, tables, community flow charts. studies performed business-specific datasets represent necessity custom pipelines like proposed one several real-world document question-answering.
منابع مشابه
Investigating Embedded Question Reuse in Question Answering
The investigation presented in this paper is a novel method in question answering (QA) that enables a QA system to gain performance through reuse of information in the answer to one question to answer another related question. Our analysis shows that a pair of question in a general open domain QA can have embedding relation through their mentions of noun phrase expressions. We present methods f...
متن کاملOpen-Domain Question Answering on Heterogeneous Data
Open-domain question answering is a hot research topic in recent years. QA Track held by NIST has offered a new evaluation on this topic. However, its target is aimed at plain text collection only. This paper focuses on QA for heterogeneous data, including plain text, summaries, tabular data, and video programs. Discussion of what necessary adaptation could be done to deal with such kind of dat...
متن کاملLeveraging Video Descriptions to Learn Video Question Answering
We propose a scalable approach to learn video-based question answering (QA): to answer a free-form natural language question about the contents of a video. Our approach automatically harvests a large number of videos and descriptions freely available online. Then, a large number of candidate QA pairs are automatically generated from descriptions rather than manually annotated. Next, we use thes...
متن کاملFora: Leveraging the Power of Internet Communities for Question Answering
This paper introduces a system for searching question answer pairs automatically extracted from the discussions in internet communities. The system, named Fora, aggregates discussions from multiple forums and newsgroups in the same domain, automatically extracts question answer pairs from the data, and provides searches of the question answer pairs. The system also offers expert search, query s...
متن کاملLeveraging Community-Built Knowledge for Type Coercion in Question Answering
Watson, the winner of the Jeopardy! challenge, is a state-of-the-art open-domain Question Answering system that tackles the fundamental issue of answer typing by using a novel type coercion (TyCor) framework, where candidate answers are initially produced without considering type information, and subsequent stages check whether the candidate can be coerced into the expected answer type. In this...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International journal on natural language computing
سال: 2021
ISSN: ['2278-1307', '2319-4111']
DOI: https://doi.org/10.5121/ijnlc.2021.10601